Overview
Brought to you by YData
Dataset statistics
| Number of variables | 141 |
|---|---|
| Number of observations | 707 |
| Missing cells | 83352 |
| Missing cells (%) | 83.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 778.9 KiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Categorical | 19 |
|---|---|
| Text | 4 |
| Boolean | 4 |
| Numeric | 10 |
| Unsupported | 104 |
body_site has constant value "stool" | Constant |
antibiotics_current_use has constant value "False" | Constant |
non_westernized has constant value "False" | Constant |
sequencing_platform has constant value "IlluminaHiSeq" | Constant |
population has constant value "Dutch" | Constant |
BMI is highly overall correlated with DNA_extraction_kit and 3 other fields | High correlation |
DNA_extraction_kit is highly overall correlated with BMI and 25 other fields | High correlation |
PMID is highly overall correlated with BMI and 25 other fields | High correlation |
age is highly overall correlated with DNA_extraction_kit and 4 other fields | High correlation |
age_category is highly overall correlated with DNA_extraction_kit and 18 other fields | High correlation |
birth_weight is highly overall correlated with DNA_extraction_kit and 11 other fields | High correlation |
born_method is highly overall correlated with DNA_extraction_kit and 9 other fields | High correlation |
breastfeeding_duration is highly overall correlated with DNA_extraction_kit and 9 other fields | High correlation |
country is highly overall correlated with DNA_extraction_kit and 21 other fields | High correlation |
curator is highly overall correlated with BMI and 25 other fields | High correlation |
days_from_first_collection is highly overall correlated with DNA_extraction_kit and 10 other fields | High correlation |
disease is highly overall correlated with DNA_extraction_kit and 19 other fields | High correlation |
disease_subtype is highly overall correlated with DNA_extraction_kit and 9 other fields | High correlation |
family_role is highly overall correlated with DNA_extraction_kit and 15 other fields | High correlation |
feeding_practice is highly overall correlated with DNA_extraction_kit and 11 other fields | High correlation |
fobt is highly overall correlated with DNA_extraction_kit and 4 other fields | High correlation |
formula_first_day is highly overall correlated with DNA_extraction_kit and 10 other fields | High correlation |
gender is highly overall correlated with birth_weight | High correlation |
gestational_age is highly overall correlated with DNA_extraction_kit and 9 other fields | High correlation |
infant_age is highly overall correlated with DNA_extraction_kit and 10 other fields | High correlation |
location is highly overall correlated with DNA_extraction_kit and 7 other fields | High correlation |
median_read_length is highly overall correlated with DNA_extraction_kit and 6 other fields | High correlation |
minimum_read_length is highly overall correlated with DNA_extraction_kit and 13 other fields | High correlation |
number_bases is highly overall correlated with DNA_extraction_kit and 5 other fields | High correlation |
number_reads is highly overall correlated with DNA_extraction_kit and 6 other fields | High correlation |
pregnant is highly overall correlated with DNA_extraction_kit and 14 other fields | High correlation |
study_condition is highly overall correlated with DNA_extraction_kit and 19 other fields | High correlation |
study_name is highly overall correlated with BMI and 25 other fields | High correlation |
median_read_length is highly imbalanced (73.4%) | Imbalance |
antibiotics_current_use has 626 (88.5%) missing values | Missing |
age has 626 (88.5%) missing values | Missing |
infant_age has 552 (78.1%) missing values | Missing |
NCBI_accession has 355 (50.2%) missing values | Missing |
pregnant has 436 (61.7%) missing values | Missing |
lactating has 707 (100.0%) missing values | Missing |
BMI has 627 (88.7%) missing values | Missing |
family has 436 (61.7%) missing values | Missing |
treatment has 707 (100.0%) missing values | Missing |
days_from_first_collection has 436 (61.7%) missing values | Missing |
family_role has 436 (61.7%) missing values | Missing |
born_method has 552 (78.1%) missing values | Missing |
feeding_practice has 556 (78.6%) missing values | Missing |
location has 626 (88.5%) missing values | Missing |
diet has 707 (100.0%) missing values | Missing |
travel_destination has 707 (100.0%) missing values | Missing |
visit_number has 707 (100.0%) missing values | Missing |
premature has 707 (100.0%) missing values | Missing |
birth_weight has 552 (78.1%) missing values | Missing |
gestational_age has 552 (78.1%) missing values | Missing |
antibiotics_family has 707 (100.0%) missing values | Missing |
disease_subtype has 352 (49.8%) missing values | Missing |
days_after_onset has 707 (100.0%) missing values | Missing |
creatine has 707 (100.0%) missing values | Missing |
albumine has 707 (100.0%) missing values | Missing |
hscrp has 707 (100.0%) missing values | Missing |
ESR has 707 (100.0%) missing values | Missing |
ast has 707 (100.0%) missing values | Missing |
alt has 707 (100.0%) missing values | Missing |
globulin has 707 (100.0%) missing values | Missing |
urea_nitrogen has 707 (100.0%) missing values | Missing |
BASDAI has 707 (100.0%) missing values | Missing |
BASFI has 707 (100.0%) missing values | Missing |
alcohol has 707 (100.0%) missing values | Missing |
flg_genotype has 707 (100.0%) missing values | Missing |
population has 352 (49.8%) missing values | Missing |
menopausal_status has 707 (100.0%) missing values | Missing |
lifestyle has 707 (100.0%) missing values | Missing |
body_subsite has 707 (100.0%) missing values | Missing |
uncurated_metadata has 707 (100.0%) missing values | Missing |
tnm has 707 (100.0%) missing values | Missing |
triglycerides has 707 (100.0%) missing values | Missing |
hdl has 707 (100.0%) missing values | Missing |
ldl has 707 (100.0%) missing values | Missing |
hba1c has 707 (100.0%) missing values | Missing |
change_in_tumor_size has 707 (100.0%) missing values | Missing |
RECIST has 707 (100.0%) missing values | Missing |
ORR has 707 (100.0%) missing values | Missing |
smoker has 707 (100.0%) missing values | Missing |
ever_smoker has 707 (100.0%) missing values | Missing |
dental_sample_type has 707 (100.0%) missing values | Missing |
history_of_periodontitis has 707 (100.0%) missing values | Missing |
PPD_M has 707 (100.0%) missing values | Missing |
PPD_B has 707 (100.0%) missing values | Missing |
PPD_D has 707 (100.0%) missing values | Missing |
PPD_L has 707 (100.0%) missing values | Missing |
fobt has 626 (88.5%) missing values | Missing |
disease_stage has 707 (100.0%) missing values | Missing |
disease_location has 707 (100.0%) missing values | Missing |
calprotectin has 707 (100.0%) missing values | Missing |
HBI has 707 (100.0%) missing values | Missing |
SCCAI has 707 (100.0%) missing values | Missing |
mumps has 707 (100.0%) missing values | Missing |
cholesterol has 707 (100.0%) missing values | Missing |
c_peptide has 707 (100.0%) missing values | Missing |
glucose has 707 (100.0%) missing values | Missing |
creatinine has 707 (100.0%) missing values | Missing |
bilubirin has 707 (100.0%) missing values | Missing |
prothrombin_time has 707 (100.0%) missing values | Missing |
wbc has 707 (100.0%) missing values | Missing |
rbc has 707 (100.0%) missing values | Missing |
hemoglobinometry has 707 (100.0%) missing values | Missing |
FMT_role has 707 (100.0%) missing values | Missing |
subcohort has 707 (100.0%) missing values | Missing |
fmt_id has 707 (100.0%) missing values | Missing |
remission has 707 (100.0%) missing values | Missing |
dyastolic_p has 707 (100.0%) missing values | Missing |
systolic_p has 707 (100.0%) missing values | Missing |
insulin_cat has 707 (100.0%) missing values | Missing |
adiponectin has 707 (100.0%) missing values | Missing |
glp_1 has 707 (100.0%) missing values | Missing |
cd163 has 707 (100.0%) missing values | Missing |
il_1 has 707 (100.0%) missing values | Missing |
leptin has 707 (100.0%) missing values | Missing |
fgf_19 has 707 (100.0%) missing values | Missing |
glutamate_decarboxylase_2_antibody has 707 (100.0%) missing values | Missing |
HLA has 707 (100.0%) missing values | Missing |
autoantibody_positive has 707 (100.0%) missing values | Missing |
age_seroconversion has 707 (100.0%) missing values | Missing |
age_T1D_diagnosis has 707 (100.0%) missing values | Missing |
hitchip_probe_class has 707 (100.0%) missing values | Missing |
previous_therapy has 707 (100.0%) missing values | Missing |
performance_status has 707 (100.0%) missing values | Missing |
toxicity_above_zero has 707 (100.0%) missing values | Missing |
PFS12 has 707 (100.0%) missing values | Missing |
fasting_insulin has 707 (100.0%) missing values | Missing |
fasting_glucose has 707 (100.0%) missing values | Missing |
protein_intake has 707 (100.0%) missing values | Missing |
stec_count has 707 (100.0%) missing values | Missing |
shigatoxin_2_elisa has 707 (100.0%) missing values | Missing |
stool_texture has 707 (100.0%) missing values | Missing |
anti_PD_1 has 707 (100.0%) missing values | Missing |
ajcc has 707 (100.0%) missing values | Missing |
smoke has 707 (100.0%) missing values | Missing |
bristol_score has 707 (100.0%) missing values | Missing |
hsCRP has 707 (100.0%) missing values | Missing |
LDL has 707 (100.0%) missing values | Missing |
mgs_richness has 707 (100.0%) missing values | Missing |
ferm_milk_prod_consumer has 707 (100.0%) missing values | Missing |
inr has 707 (100.0%) missing values | Missing |
birth_control_pil has 707 (100.0%) missing values | Missing |
c_section_type has 707 (100.0%) missing values | Missing |
hla_drb12 has 707 (100.0%) missing values | Missing |
hla_dqa12 has 707 (100.0%) missing values | Missing |
hla_dqa11 has 707 (100.0%) missing values | Missing |
hla_drb11 has 707 (100.0%) missing values | Missing |
zigosity has 707 (100.0%) missing values | Missing |
brinkman_index has 707 (100.0%) missing values | Missing |
alcohol_numeric has 707 (100.0%) missing values | Missing |
breastfeeding_duration has 571 (80.8%) missing values | Missing |
formula_first_day has 555 (78.5%) missing values | Missing |
ALT has 707 (100.0%) missing values | Missing |
eGFR has 707 (100.0%) missing values | Missing |
sample_id has unique values | Unique |
number_reads has unique values | Unique |
number_bases has unique values | Unique |
lactating is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
treatment is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
diet is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
travel_destination is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
visit_number is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
premature is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
antibiotics_family is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
days_after_onset is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
creatine is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
albumine is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hscrp is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ESR is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ast is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
alt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
globulin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
urea_nitrogen is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
BASDAI is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
BASFI is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
alcohol is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
flg_genotype is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
menopausal_status is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lifestyle is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
body_subsite is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
uncurated_metadata is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
tnm is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
triglycerides is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hdl is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ldl is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hba1c is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
change_in_tumor_size is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
RECIST is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ORR is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
smoker is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ever_smoker is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
dental_sample_type is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
history_of_periodontitis is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PPD_M is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PPD_B is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PPD_D is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PPD_L is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
disease_stage is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
disease_location is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
calprotectin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
HBI is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
SCCAI is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
mumps is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
cholesterol is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
c_peptide is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
glucose is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
creatinine is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
bilubirin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
prothrombin_time is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
wbc is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
rbc is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hemoglobinometry is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
FMT_role is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
subcohort is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
fmt_id is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
remission is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
dyastolic_p is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
systolic_p is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
insulin_cat is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
adiponectin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
glp_1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
cd163 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
il_1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
leptin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
fgf_19 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
glutamate_decarboxylase_2_antibody is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
HLA is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
autoantibody_positive is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
age_seroconversion is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
age_T1D_diagnosis is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hitchip_probe_class is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
previous_therapy is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
performance_status is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
toxicity_above_zero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PFS12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
fasting_insulin is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
fasting_glucose is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
protein_intake is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
stec_count is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
shigatoxin_2_elisa is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
stool_texture is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
anti_PD_1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ajcc is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
smoke is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
bristol_score is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hsCRP is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
LDL is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
mgs_richness is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ferm_milk_prod_consumer is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
inr is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
birth_control_pil is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
c_section_type is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hla_drb12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hla_dqa12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hla_dqa11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
hla_drb11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
zigosity is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
brinkman_index is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
alcohol_numeric is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ALT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
eGFR is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
days_from_first_collection has 69 (9.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-30 01:12:46.396837 |
|---|---|
| Analysis finished | 2025-03-30 01:12:55.540996 |
| Duration | 9.14 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
study_name
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| VilaAV_2018 | |
|---|---|
| YassourM_2018 | |
| HanniganGD_2017 |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 12.224894 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HanniganGD_2017 |
|---|---|
| 2nd row | HanniganGD_2017 |
| 3rd row | HanniganGD_2017 |
| 4th row | HanniganGD_2017 |
| 5th row | HanniganGD_2017 |
Common Values
| Value | Count | Frequency (%) |
| VilaAV_2018 | 355 | |
| YassourM_2018 | 271 | |
| HanniganGD_2017 | 81 | 11.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| vilaav_2018 | 355 | |
| yassourm_2018 | 271 | |
| hannigangd_2017 | 81 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 788 | 9.1% |
| V | 710 | 8.2% |
| 0 | 707 | 8.2% |
| 1 | 707 | 8.2% |
| _ | 707 | 8.2% |
| 2 | 707 | 8.2% |
| 8 | 626 | 7.2% |
| s | 542 | 6.3% |
| i | 436 | 5.0% |
| A | 355 | 4.1% |
| Other values (12) | 2358 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 788 | 9.1% |
| V | 710 | 8.2% |
| 0 | 707 | 8.2% |
| 1 | 707 | 8.2% |
| _ | 707 | 8.2% |
| 2 | 707 | 8.2% |
| 8 | 626 | 7.2% |
| s | 542 | 6.3% |
| i | 436 | 5.0% |
| A | 355 | 4.1% |
| Other values (12) | 2358 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 788 | 9.1% |
| V | 710 | 8.2% |
| 0 | 707 | 8.2% |
| 1 | 707 | 8.2% |
| _ | 707 | 8.2% |
| 2 | 707 | 8.2% |
| 8 | 626 | 7.2% |
| s | 542 | 6.3% |
| i | 436 | 5.0% |
| A | 355 | 4.1% |
| Other values (12) | 2358 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 788 | 9.1% |
| V | 710 | 8.2% |
| 0 | 707 | 8.2% |
| 1 | 707 | 8.2% |
| _ | 707 | 8.2% |
| 2 | 707 | 8.2% |
| 8 | 626 | 7.2% |
| s | 542 | 6.3% |
| i | 436 | 5.0% |
| A | 355 | 4.1% |
| Other values (12) | 2358 |
sample_id
Text
Unique 
| Distinct | 707 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
Length
| Max length | 28 |
|---|---|
| Median length | 28 |
| Mean length | 17.659123 |
| Min length | 7 |
Unique
| Unique | 707 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | MG100208 |
|---|---|
| 2nd row | MG100207 |
| 3rd row | MG100206 |
| 4th row | MG100205 |
| 5th row | MG100204 |
| Value | Count | Frequency (%) |
| mg100208 | 1 | 0.1% |
| egar00001763476_1000ibd00202 | 1 | 0.1% |
| mg100198 | 1 | 0.1% |
| mg100206 | 1 | 0.1% |
| mg100205 | 1 | 0.1% |
| mg100204 | 1 | 0.1% |
| mg100203 | 1 | 0.1% |
| mg100202 | 1 | 0.1% |
| mg100201 | 1 | 0.1% |
| mg100200 | 1 | 0.1% |
| Other values (697) | 697 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3903 | |
| 1 | 1392 | 11.1% |
| 3 | 771 | 6.2% |
| 6 | 744 | 6.0% |
| G | 707 | 5.7% |
| 7 | 662 | 5.3% |
| 2 | 552 | 4.4% |
| 4 | 429 | 3.4% |
| D | 355 | 2.8% |
| _ | 355 | 2.8% |
| Other values (9) | 2615 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12485 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3903 | |
| 1 | 1392 | 11.1% |
| 3 | 771 | 6.2% |
| 6 | 744 | 6.0% |
| G | 707 | 5.7% |
| 7 | 662 | 5.3% |
| 2 | 552 | 4.4% |
| 4 | 429 | 3.4% |
| D | 355 | 2.8% |
| _ | 355 | 2.8% |
| Other values (9) | 2615 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12485 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3903 | |
| 1 | 1392 | 11.1% |
| 3 | 771 | 6.2% |
| 6 | 744 | 6.0% |
| G | 707 | 5.7% |
| 7 | 662 | 5.3% |
| 2 | 552 | 4.4% |
| 4 | 429 | 3.4% |
| D | 355 | 2.8% |
| _ | 355 | 2.8% |
| Other values (9) | 2615 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12485 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3903 | |
| 1 | 1392 | 11.1% |
| 3 | 771 | 6.2% |
| 6 | 744 | 6.0% |
| G | 707 | 5.7% |
| 7 | 662 | 5.3% |
| 2 | 552 | 4.4% |
| 4 | 429 | 3.4% |
| D | 355 | 2.8% |
| _ | 355 | 2.8% |
| Other values (9) | 2615 |
subject_id
Text
| Distinct | 516 |
|---|---|
| Distinct (%) | 73.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 12.510608 |
| Min length | 6 |
Unique
| Unique | 440 ? |
|---|---|
| Unique (%) | 62.2% |
Sample
| 1st row | HanniganGD_2017_A29 |
|---|---|
| 2nd row | HanniganGD_2017_A28 |
| 3rd row | HanniganGD_2017_A27 |
| 4th row | HanniganGD_2017_A26 |
| 5th row | HanniganGD_2017_A25 |
| Value | Count | Frequency (%) |
| m0038c | 5 | 0.7% |
| m0072c | 5 | 0.7% |
| m0226c | 5 | 0.7% |
| m1098c | 5 | 0.7% |
| m0333c | 5 | 0.7% |
| m0388c | 5 | 0.7% |
| m0399c | 5 | 0.7% |
| m0346c | 5 | 0.7% |
| m0201c | 5 | 0.7% |
| m0327c | 5 | 0.7% |
| Other values (506) | 657 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| 1 | 661 | 7.5% |
| _ | 517 | 5.8% |
| D | 436 | 4.9% |
| M | 387 | 4.4% |
| s | 355 | 4.0% |
| B | 355 | 4.0% |
| I | 355 | 4.0% |
| b | 355 | 4.0% |
| u | 355 | 4.0% |
| Other values (16) | 2669 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8845 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| 1 | 661 | 7.5% |
| _ | 517 | 5.8% |
| D | 436 | 4.9% |
| M | 387 | 4.4% |
| s | 355 | 4.0% |
| B | 355 | 4.0% |
| I | 355 | 4.0% |
| b | 355 | 4.0% |
| u | 355 | 4.0% |
| Other values (16) | 2669 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8845 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| 1 | 661 | 7.5% |
| _ | 517 | 5.8% |
| D | 436 | 4.9% |
| M | 387 | 4.4% |
| s | 355 | 4.0% |
| B | 355 | 4.0% |
| I | 355 | 4.0% |
| b | 355 | 4.0% |
| u | 355 | 4.0% |
| Other values (16) | 2669 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8845 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| 1 | 661 | 7.5% |
| _ | 517 | 5.8% |
| D | 436 | 4.9% |
| M | 387 | 4.4% |
| s | 355 | 4.0% |
| B | 355 | 4.0% |
| I | 355 | 4.0% |
| b | 355 | 4.0% |
| u | 355 | 4.0% |
| Other values (16) | 2669 |
body_site
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| stool |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | stool |
|---|---|
| 2nd row | stool |
| 3rd row | stool |
| 4th row | stool |
| 5th row | stool |
Common Values
| Value | Count | Frequency (%) |
| stool | 707 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| stool | 707 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1414 | |
| s | 707 | |
| t | 707 | |
| l | 707 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3535 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1414 | |
| s | 707 | |
| t | 707 | |
| l | 707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3535 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1414 | |
| s | 707 | |
| t | 707 | |
| l | 707 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3535 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1414 | |
| s | 707 | |
| t | 707 | |
| l | 707 |
antibiotics_current_use
Boolean
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 626 |
| Missing (%) | 88.5% |
| Memory size | 1.5 KiB |
| False | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| False | 81 | 11.5% |
| (Missing) | 626 |
study_condition
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| IBD | |
|---|---|
| control | |
| CRC | 27 |
| adenoma | 26 |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 4.8387553 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | adenoma |
|---|---|
| 2nd row | adenoma |
| 3rd row | adenoma |
| 4th row | adenoma |
| 5th row | adenoma |
Common Values
| Value | Count | Frequency (%) |
| IBD | 355 | |
| control | 299 | |
| CRC | 27 | 3.8% |
| adenoma | 26 | 3.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ibd | 355 | |
| control | 299 | |
| crc | 27 | 3.8% |
| adenoma | 26 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 624 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| n | 325 | |
| c | 299 | |
| t | 299 | |
| r | 299 | |
| l | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 157 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 624 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| n | 325 | |
| c | 299 | |
| t | 299 | |
| r | 299 | |
| l | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 157 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 624 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| n | 325 | |
| c | 299 | |
| t | 299 | |
| r | 299 | |
| l | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 157 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 624 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| n | 325 | |
| c | 299 | |
| t | 299 | |
| r | 299 | |
| l | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 157 | 4.6% |
disease
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| IBD | |
|---|---|
| healthy | |
| CRC | 27 |
| adenoma | 26 |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 4.8387553 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | adenoma |
|---|---|
| 2nd row | adenoma |
| 3rd row | adenoma |
| 4th row | adenoma |
| 5th row | adenoma |
Common Values
| Value | Count | Frequency (%) |
| IBD | 355 | |
| healthy | 299 | |
| CRC | 27 | 3.8% |
| adenoma | 26 | 3.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ibd | 355 | |
| healthy | 299 | |
| crc | 27 | 3.8% |
| adenoma | 26 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 598 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| a | 351 | |
| e | 325 | |
| l | 299 | |
| t | 299 | |
| y | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 131 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 598 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| a | 351 | |
| e | 325 | |
| l | 299 | |
| t | 299 | |
| y | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 131 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 598 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| a | 351 | |
| e | 325 | |
| l | 299 | |
| t | 299 | |
| y | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 131 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3421 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 598 | |
| I | 355 | |
| B | 355 | |
| D | 355 | |
| a | 351 | |
| e | 325 | |
| l | 299 | |
| t | 299 | |
| y | 299 | |
| C | 54 | 1.6% |
| Other values (5) | 131 | 3.8% |
age
Real number (ℝ)
High correlation  Missing 
| Distinct | 38 |
|---|---|
| Distinct (%) | 46.9% |
| Missing | 626 |
| Missing (%) | 88.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.580247 |
| Minimum | 35 |
|---|---|
| Maximum | 88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 43 |
| Q1 | 51 |
| median | 59 |
| Q3 | 65 |
| 95-th percentile | 75 |
| Maximum | 88 |
| Range | 53 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.793359 |
|---|---|
| Coefficient of variation (CV) | 0.18424913 |
| Kurtosis | -0.17597086 |
| Mean | 58.580247 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.16569532 |
| Sum | 4745 |
| Variance | 116.4966 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 51 | 7 | 1.0% |
| 61 | 5 | 0.7% |
| 69 | 4 | 0.6% |
| 63 | 4 | 0.6% |
| 64 | 4 | 0.6% |
| 65 | 4 | 0.6% |
| 58 | 3 | 0.4% |
| 47 | 3 | 0.4% |
| 59 | 3 | 0.4% |
| 52 | 3 | 0.4% |
| Other values (28) | 41 | 5.8% |
| (Missing) | 626 |
| Value | Count | Frequency (%) |
| 35 | 1 | 0.1% |
| 37 | 2 | |
| 42 | 1 | 0.1% |
| 43 | 2 | |
| 44 | 1 | 0.1% |
| 45 | 2 | |
| 46 | 1 | 0.1% |
| 47 | 3 | |
| 48 | 1 | 0.1% |
| 49 | 2 |
| Value | Count | Frequency (%) |
| 88 | 1 | 0.1% |
| 82 | 1 | 0.1% |
| 80 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| 75 | 2 | |
| 73 | 2 | |
| 72 | 1 | 0.1% |
| 71 | 2 | |
| 70 | 1 | 0.1% |
| 69 | 4 |
infant_age
Categorical
High correlation  Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 552 |
| Missing (%) | 78.1% |
| Memory size | 5.7 KiB |
| 60.0 | |
|---|---|
| 14.0 | |
| 30.0 | |
| 90.0 | |
| 0.0 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.8258065 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 60.0 |
|---|---|
| 2nd row | 90.0 |
| 3rd row | 14.0 |
| 4th row | 30.0 |
| 5th row | 60.0 |
Common Values
| Value | Count | Frequency (%) |
| 60.0 | 33 | 4.7% |
| 14.0 | 32 | 4.5% |
| 30.0 | 32 | 4.5% |
| 90.0 | 31 | 4.4% |
| 0.0 | 27 | 3.8% |
| (Missing) | 552 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 60.0 | 33 | |
| 14.0 | 32 | |
| 30.0 | 32 | |
| 90.0 | 31 | |
| 0.0 | 27 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 278 | |
| . | 155 | |
| 6 | 33 | 5.6% |
| 1 | 32 | 5.4% |
| 4 | 32 | 5.4% |
| 3 | 32 | 5.4% |
| 9 | 31 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 593 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 278 | |
| . | 155 | |
| 6 | 33 | 5.6% |
| 1 | 32 | 5.4% |
| 4 | 32 | 5.4% |
| 3 | 32 | 5.4% |
| 9 | 31 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 593 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 278 | |
| . | 155 | |
| 6 | 33 | 5.6% |
| 1 | 32 | 5.4% |
| 4 | 32 | 5.4% |
| 3 | 32 | 5.4% |
| 9 | 31 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 593 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 278 | |
| . | 155 | |
| 6 | 33 | 5.6% |
| 1 | 32 | 5.4% |
| 4 | 32 | 5.4% |
| 3 | 32 | 5.4% |
| 9 | 31 | 5.2% |
age_category
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| adult | |
|---|---|
| newborn | |
| senior | 22 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.4695898 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | adult |
|---|---|
| 2nd row | adult |
| 3rd row | senior |
| 4th row | senior |
| 5th row | adult |
Common Values
| Value | Count | Frequency (%) |
| adult | 530 | |
| newborn | 155 | 21.9% |
| senior | 22 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| adult | 530 | |
| newborn | 155 | 21.9% |
| senior | 22 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 530 | |
| d | 530 | |
| u | 530 | |
| l | 530 | |
| t | 530 | |
| n | 332 | |
| e | 177 | 4.6% |
| o | 177 | 4.6% |
| r | 177 | 4.6% |
| w | 155 | 4.0% |
| Other values (3) | 199 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3867 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 530 | |
| d | 530 | |
| u | 530 | |
| l | 530 | |
| t | 530 | |
| n | 332 | |
| e | 177 | 4.6% |
| o | 177 | 4.6% |
| r | 177 | 4.6% |
| w | 155 | 4.0% |
| Other values (3) | 199 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3867 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 530 | |
| d | 530 | |
| u | 530 | |
| l | 530 | |
| t | 530 | |
| n | 332 | |
| e | 177 | 4.6% |
| o | 177 | 4.6% |
| r | 177 | 4.6% |
| w | 155 | 4.0% |
| Other values (3) | 199 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3867 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 530 | |
| d | 530 | |
| u | 530 | |
| l | 530 | |
| t | 530 | |
| n | 332 | |
| e | 177 | 4.6% |
| o | 177 | 4.6% |
| r | 177 | 4.6% |
| w | 155 | 4.0% |
| Other values (3) | 199 | 5.1% |
gender
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| female | |
|---|---|
| male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.3125884 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | female |
|---|---|
| 2nd row | male |
| 3rd row | male |
| 4th row | female |
| 5th row | female |
Common Values
| Value | Count | Frequency (%) |
| female | 464 | |
| male | 243 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 464 | |
| male | 243 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1171 | |
| m | 707 | |
| a | 707 | |
| l | 707 | |
| f | 464 | 12.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3756 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1171 | |
| m | 707 | |
| a | 707 | |
| l | 707 | |
| f | 464 | 12.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3756 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1171 | |
| m | 707 | |
| a | 707 | |
| l | 707 | |
| f | 464 | 12.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3756 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1171 | |
| m | 707 | |
| a | 707 | |
| l | 707 | |
| f | 464 | 12.4% |
country
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| NLD | |
|---|---|
| FIN | |
| USA | |
| CAN | 27 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CAN |
|---|---|
| 2nd row | CAN |
| 3rd row | CAN |
| 4th row | CAN |
| 5th row | CAN |
Common Values
| Value | Count | Frequency (%) |
| NLD | 355 | |
| FIN | 271 | |
| USA | 54 | 7.6% |
| CAN | 27 | 3.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nld | 355 | |
| fin | 271 | |
| usa | 54 | 7.6% |
| can | 27 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 653 | |
| L | 355 | |
| D | 355 | |
| F | 271 | |
| I | 271 | |
| A | 81 | 3.8% |
| U | 54 | 2.5% |
| S | 54 | 2.5% |
| C | 27 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2121 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 653 | |
| L | 355 | |
| D | 355 | |
| F | 271 | |
| I | 271 | |
| A | 81 | 3.8% |
| U | 54 | 2.5% |
| S | 54 | 2.5% |
| C | 27 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2121 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 653 | |
| L | 355 | |
| D | 355 | |
| F | 271 | |
| I | 271 | |
| A | 81 | 3.8% |
| U | 54 | 2.5% |
| S | 54 | 2.5% |
| C | 27 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2121 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 653 | |
| L | 355 | |
| D | 355 | |
| F | 271 | |
| I | 271 | |
| A | 81 | 3.8% |
| U | 54 | 2.5% |
| S | 54 | 2.5% |
| C | 27 | 1.3% |
non_westernized
Boolean
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 839.0 B |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 707 |
sequencing_platform
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| IlluminaHiSeq |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IlluminaHiSeq |
|---|---|
| 2nd row | IlluminaHiSeq |
| 3rd row | IlluminaHiSeq |
| 4th row | IlluminaHiSeq |
| 5th row | IlluminaHiSeq |
Common Values
| Value | Count | Frequency (%) |
| IlluminaHiSeq | 707 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| illuminahiseq | 707 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1414 | |
| i | 1414 | |
| I | 707 | |
| u | 707 | |
| m | 707 | |
| n | 707 | |
| a | 707 | |
| H | 707 | |
| S | 707 | |
| e | 707 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9191 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 1414 | |
| i | 1414 | |
| I | 707 | |
| u | 707 | |
| m | 707 | |
| n | 707 | |
| a | 707 | |
| H | 707 | |
| S | 707 | |
| e | 707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9191 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 1414 | |
| i | 1414 | |
| I | 707 | |
| u | 707 | |
| m | 707 | |
| n | 707 | |
| a | 707 | |
| H | 707 | |
| S | 707 | |
| e | 707 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9191 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 1414 | |
| i | 1414 | |
| I | 707 | |
| u | 707 | |
| m | 707 | |
| n | 707 | |
| a | 707 | |
| H | 707 | |
| S | 707 | |
| e | 707 |
DNA_extraction_kit
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Qiagen | |
|---|---|
| PowerSoil | |
| MoBio |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 7.0353607 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MoBio |
|---|---|
| 2nd row | MoBio |
| 3rd row | MoBio |
| 4th row | MoBio |
| 5th row | MoBio |
Common Values
| Value | Count | Frequency (%) |
| Qiagen | 355 | |
| PowerSoil | 271 | |
| MoBio | 81 | 11.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| qiagen | 355 | |
| powersoil | 271 | |
| mobio | 81 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 707 | |
| o | 704 | |
| e | 626 | |
| Q | 355 | |
| a | 355 | |
| g | 355 | |
| n | 355 | |
| P | 271 | 5.4% |
| w | 271 | 5.4% |
| r | 271 | 5.4% |
| Other values (4) | 704 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4974 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 707 | |
| o | 704 | |
| e | 626 | |
| Q | 355 | |
| a | 355 | |
| g | 355 | |
| n | 355 | |
| P | 271 | 5.4% |
| w | 271 | 5.4% |
| r | 271 | 5.4% |
| Other values (4) | 704 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4974 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 707 | |
| o | 704 | |
| e | 626 | |
| Q | 355 | |
| a | 355 | |
| g | 355 | |
| n | 355 | |
| P | 271 | 5.4% |
| w | 271 | 5.4% |
| r | 271 | 5.4% |
| Other values (4) | 704 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4974 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 707 | |
| o | 704 | |
| e | 626 | |
| Q | 355 | |
| a | 355 | |
| g | 355 | |
| n | 355 | |
| P | 271 | 5.4% |
| w | 271 | 5.4% |
| r | 271 | 5.4% |
| Other values (4) | 704 |
PMID
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| 30567928 | |
|---|---|
| 30001517 | |
| 30459201 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 30459201 |
|---|---|
| 2nd row | 30459201 |
| 3rd row | 30459201 |
| 4th row | 30459201 |
| 5th row | 30459201 |
Common Values
| Value | Count | Frequency (%) |
| 30567928 | 355 | |
| 30001517 | 271 | |
| 30459201 | 81 | 11.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 30567928 | 355 | |
| 30001517 | 271 | |
| 30459201 | 81 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1330 | |
| 3 | 707 | |
| 5 | 707 | |
| 7 | 626 | |
| 1 | 623 | |
| 9 | 436 | 7.7% |
| 2 | 436 | 7.7% |
| 6 | 355 | 6.3% |
| 8 | 355 | 6.3% |
| 4 | 81 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5656 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1330 | |
| 3 | 707 | |
| 5 | 707 | |
| 7 | 626 | |
| 1 | 623 | |
| 9 | 436 | 7.7% |
| 2 | 436 | 7.7% |
| 6 | 355 | 6.3% |
| 8 | 355 | 6.3% |
| 4 | 81 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5656 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1330 | |
| 3 | 707 | |
| 5 | 707 | |
| 7 | 626 | |
| 1 | 623 | |
| 9 | 436 | 7.7% |
| 2 | 436 | 7.7% |
| 6 | 355 | 6.3% |
| 8 | 355 | 6.3% |
| 4 | 81 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5656 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1330 | |
| 3 | 707 | |
| 5 | 707 | |
| 7 | 626 | |
| 1 | 623 | |
| 9 | 436 | 7.7% |
| 2 | 436 | 7.7% |
| 6 | 355 | 6.3% |
| 8 | 355 | 6.3% |
| 4 | 81 | 1.4% |
number_reads
Real number (ℝ)
High correlation  Unique 
| Distinct | 707 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23644491 |
| Minimum | 17146 |
|---|---|
| Maximum | 61282548 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 17146 |
|---|---|
| 5-th percentile | 3704368 |
| Q1 | 14631833 |
| median | 22911696 |
| Q3 | 33073995 |
| 95-th percentile | 43157820 |
| Maximum | 61282548 |
| Range | 61265402 |
| Interquartile range (IQR) | 18442162 |
Descriptive statistics
| Standard deviation | 12319299 |
|---|---|
| Coefficient of variation (CV) | 0.52102195 |
| Kurtosis | -0.54290552 |
| Mean | 23644491 |
| Median Absolute Deviation (MAD) | 9071348 |
| Skewness | 0.15906303 |
| Sum | 1.6716655 × 1010 |
| Variance | 1.5176513 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6133350 | 1 | 0.1% |
| 24298490 | 1 | 0.1% |
| 32354982 | 1 | 0.1% |
| 40224828 | 1 | 0.1% |
| 35364602 | 1 | 0.1% |
| 26496 | 1 | 0.1% |
| 34216240 | 1 | 0.1% |
| 32741126 | 1 | 0.1% |
| 28962234 | 1 | 0.1% |
| 22129382 | 1 | 0.1% |
| Other values (697) | 697 |
| Value | Count | Frequency (%) |
| 17146 | 1 | |
| 26496 | 1 | |
| 52356 | 1 | |
| 69510 | 1 | |
| 125344 | 1 | |
| 194214 | 1 | |
| 198646 | 1 | |
| 330750 | 1 | |
| 348544 | 1 | |
| 611596 | 1 |
| Value | Count | Frequency (%) |
| 61282548 | 1 | |
| 57831136 | 1 | |
| 57570980 | 1 | |
| 54594488 | 1 | |
| 53033258 | 1 | |
| 52420562 | 1 | |
| 51078896 | 1 | |
| 50441276 | 1 | |
| 50113134 | 1 | |
| 49914668 | 1 |
number_bases
Real number (ℝ)
High correlation  Unique 
| Distinct | 707 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3412048 × 109 |
| Minimum | 2086283 |
|---|---|
| Maximum | 6.0902587 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 2086283 |
|---|---|
| 5-th percentile | 4.5041067 × 108 |
| Q1 | 1.4536673 × 109 |
| median | 2.2335913 × 109 |
| Q3 | 3.247629 × 109 |
| 95-th percentile | 4.2638767 × 109 |
| Maximum | 6.0902587 × 109 |
| Range | 6.0881724 × 109 |
| Interquartile range (IQR) | 1.7939617 × 109 |
Descriptive statistics
| Standard deviation | 1.1995216 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.51235226 |
| Kurtosis | -0.4914903 |
| Mean | 2.3412048 × 109 |
| Median Absolute Deviation (MAD) | 8.7806468 × 108 |
| Skewness | 0.22538048 |
| Sum | 1.6552318 × 1012 |
| Variance | 1.438852 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 763336051 | 1 | 0.1% |
| 2399181212 | 1 | 0.1% |
| 3213885342 | 1 | 0.1% |
| 3999590449 | 1 | 0.1% |
| 3476933937 | 1 | 0.1% |
| 2443992 | 1 | 0.1% |
| 3362292000 | 1 | 0.1% |
| 3231276322 | 1 | 0.1% |
| 2855311495 | 1 | 0.1% |
| 2189611119 | 1 | 0.1% |
| Other values (697) | 697 |
| Value | Count | Frequency (%) |
| 2086283 | 1 | |
| 2443992 | 1 | |
| 6483099 | 1 | |
| 6625974 | 1 | |
| 12254835 | 1 | |
| 19193033 | 1 | |
| 19211612 | 1 | |
| 34675905 | 1 | |
| 41203287 | 1 | |
| 59380464 | 1 |
| Value | Count | Frequency (%) |
| 6090258687 | 1 | |
| 5748426209 | 1 | |
| 5712739338 | 1 | |
| 5417298173 | 1 | |
| 5281156418 | 1 | |
| 5076562618 | 1 | |
| 5074055134 | 1 | |
| 5010104788 | 1 | |
| 4992535615 | 1 | |
| 4961648630 | 1 |
minimum_read_length
Real number (ℝ)
High correlation 
| Distinct | 13 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.762376 |
| Minimum | 50 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 57 |
| median | 60 |
| Q3 | 60 |
| 95-th percentile | 75 |
| Maximum | 80 |
| Range | 30 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 7.2497025 |
|---|---|
| Coefficient of variation (CV) | 0.12130881 |
| Kurtosis | 0.43237454 |
| Mean | 59.762376 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.78733893 |
| Sum | 42252 |
| Variance | 52.558186 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 398 | |
| 50 | 85 | 12.0% |
| 75 | 81 | 11.5% |
| 51 | 77 | 10.9% |
| 57 | 22 | 3.1% |
| 64 | 15 | 2.1% |
| 63 | 10 | 1.4% |
| 52 | 7 | 1.0% |
| 78 | 5 | 0.7% |
| 71 | 3 | 0.4% |
| Other values (3) | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 50 | 85 | 12.0% |
| 51 | 77 | 10.9% |
| 52 | 7 | 1.0% |
| 57 | 22 | 3.1% |
| 60 | 398 | |
| 63 | 10 | 1.4% |
| 64 | 15 | 2.1% |
| 71 | 3 | 0.4% |
| 73 | 1 | 0.1% |
| 75 | 81 | 11.5% |
| Value | Count | Frequency (%) |
| 80 | 2 | 0.3% |
| 78 | 5 | 0.7% |
| 76 | 1 | 0.1% |
| 75 | 81 | 11.5% |
| 73 | 1 | 0.1% |
| 71 | 3 | 0.4% |
| 64 | 15 | 2.1% |
| 63 | 10 | 1.4% |
| 60 | 398 | |
| 57 | 22 | 3.1% |
median_read_length
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| 101 | |
|---|---|
| 126 | |
| 100 | 6 |
| 125 | 2 |
| 95 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9985856 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 126 |
|---|---|
| 2nd row | 126 |
| 3rd row | 126 |
| 4th row | 126 |
| 5th row | 126 |
Common Values
| Value | Count | Frequency (%) |
| 101 | 619 | |
| 126 | 79 | 11.2% |
| 100 | 6 | 0.8% |
| 125 | 2 | 0.3% |
| 95 | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 101 | 619 | |
| 126 | 79 | 11.2% |
| 100 | 6 | 0.8% |
| 125 | 2 | 0.3% |
| 95 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1325 | |
| 0 | 631 | |
| 2 | 81 | 3.8% |
| 6 | 79 | 3.7% |
| 5 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2120 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1325 | |
| 0 | 631 | |
| 2 | 81 | 3.8% |
| 6 | 79 | 3.7% |
| 5 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2120 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1325 | |
| 0 | 631 | |
| 2 | 81 | 3.8% |
| 6 | 79 | 3.7% |
| 5 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2120 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1325 | |
| 0 | 631 | |
| 2 | 81 | 3.8% |
| 6 | 79 | 3.7% |
| 5 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
NCBI_accession
Text
Missing 
| Distinct | 352 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 355 |
| Missing (%) | 50.2% |
| Memory size | 5.7 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 352 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | SRR5665080 |
|---|---|
| 2nd row | SRR5665075 |
| 3rd row | SRR5665074 |
| 4th row | SRR5665073 |
| 5th row | SRR5665072 |
| Value | Count | Frequency (%) |
| srr5665080 | 1 | 0.3% |
| srr7280919 | 1 | 0.3% |
| srr5665074 | 1 | 0.3% |
| srr5665073 | 1 | 0.3% |
| srr5665072 | 1 | 0.3% |
| srr5665079 | 1 | 0.3% |
| srr5665078 | 1 | 0.3% |
| srr5665077 | 1 | 0.3% |
| srr5665076 | 1 | 0.3% |
| srr5665134 | 1 | 0.3% |
| Other values (342) | 342 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 704 | |
| 8 | 427 | |
| 0 | 374 | |
| S | 352 | |
| 7 | 351 | |
| 2 | 344 | |
| 5 | 244 | 6.9% |
| 6 | 233 | 6.6% |
| 1 | 180 | 5.1% |
| 9 | 158 | 4.5% |
| Other values (2) | 153 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3520 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 704 | |
| 8 | 427 | |
| 0 | 374 | |
| S | 352 | |
| 7 | 351 | |
| 2 | 344 | |
| 5 | 244 | 6.9% |
| 6 | 233 | 6.6% |
| 1 | 180 | 5.1% |
| 9 | 158 | 4.5% |
| Other values (2) | 153 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3520 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 704 | |
| 8 | 427 | |
| 0 | 374 | |
| S | 352 | |
| 7 | 351 | |
| 2 | 344 | |
| 5 | 244 | 6.9% |
| 6 | 233 | 6.6% |
| 1 | 180 | 5.1% |
| 9 | 158 | 4.5% |
| Other values (2) | 153 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3520 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 704 | |
| 8 | 427 | |
| 0 | 374 | |
| S | 352 | |
| 7 | 351 | |
| 2 | 344 | |
| 5 | 244 | 6.9% |
| 6 | 233 | 6.6% |
| 1 | 180 | 5.1% |
| 9 | 158 | 4.5% |
| Other values (2) | 153 | 4.3% |
pregnant
Boolean
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 436 |
| Missing (%) | 61.7% |
| Memory size | 1.5 KiB |
| False | |
|---|---|
| True | 42 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 229 | |
| True | 42 | 5.9% |
| (Missing) | 436 |
lactating
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
curator
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Ilya_Likhotkin;Paolo_Manghi | |
|---|---|
| Marisa_Metzger | |
| Paolo_Manghi |
Length
| Max length | 27 |
|---|---|
| Median length | 27 |
| Mean length | 20.298444 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Paolo_Manghi |
|---|---|
| 2nd row | Paolo_Manghi |
| 3rd row | Paolo_Manghi |
| 4th row | Paolo_Manghi |
| 5th row | Paolo_Manghi |
Common Values
| Value | Count | Frequency (%) |
| Ilya_Likhotkin;Paolo_Manghi | 355 | |
| Marisa_Metzger | 271 | |
| Paolo_Manghi | 81 | 11.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ilya_likhotkin;paolo_manghi | 355 | |
| marisa_metzger | 271 | |
| paolo_manghi | 81 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1769 | |
| i | 1417 | 9.9% |
| o | 1227 | 8.5% |
| _ | 1062 | 7.4% |
| M | 978 | 6.8% |
| h | 791 | 5.5% |
| l | 791 | 5.5% |
| n | 791 | 5.5% |
| k | 710 | 4.9% |
| g | 707 | 4.9% |
| Other values (10) | 4108 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14351 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1769 | |
| i | 1417 | 9.9% |
| o | 1227 | 8.5% |
| _ | 1062 | 7.4% |
| M | 978 | 6.8% |
| h | 791 | 5.5% |
| l | 791 | 5.5% |
| n | 791 | 5.5% |
| k | 710 | 4.9% |
| g | 707 | 4.9% |
| Other values (10) | 4108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14351 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1769 | |
| i | 1417 | 9.9% |
| o | 1227 | 8.5% |
| _ | 1062 | 7.4% |
| M | 978 | 6.8% |
| h | 791 | 5.5% |
| l | 791 | 5.5% |
| n | 791 | 5.5% |
| k | 710 | 4.9% |
| g | 707 | 4.9% |
| Other values (10) | 4108 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14351 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1769 | |
| i | 1417 | 9.9% |
| o | 1227 | 8.5% |
| _ | 1062 | 7.4% |
| M | 978 | 6.8% |
| h | 791 | 5.5% |
| l | 791 | 5.5% |
| n | 791 | 5.5% |
| k | 710 | 4.9% |
| g | 707 | 4.9% |
| Other values (10) | 4108 |
BMI
Real number (ℝ)
High correlation  Missing 
| Distinct | 75 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 627 |
| Missing (%) | 88.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.05552 |
| Minimum | 18.662015 |
|---|---|
| Maximum | 57.463494 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 18.662015 |
|---|---|
| 5-th percentile | 21.09295 |
| Q1 | 23.863606 |
| median | 27.019632 |
| Q3 | 31.606655 |
| 95-th percentile | 35.89538 |
| Maximum | 57.463494 |
| Range | 38.801479 |
| Interquartile range (IQR) | 7.7430497 |
Descriptive statistics
| Standard deviation | 6.1195984 |
|---|---|
| Coefficient of variation (CV) | 0.21812457 |
| Kurtosis | 6.1087119 |
| Mean | 28.05552 |
| Median Absolute Deviation (MAD) | 3.4462189 |
| Skewness | 1.7809348 |
| Sum | 2244.4416 |
| Variance | 37.449485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21.33821064 | 3 | 0.4% |
| 35.49786654 | 2 | 0.3% |
| 21.1552942 | 2 | 0.3% |
| 27.82931354 | 2 | 0.3% |
| 21.51385851 | 1 | 0.1% |
| 46.36734694 | 1 | 0.1% |
| 29.75206612 | 1 | 0.1% |
| 21.62064772 | 1 | 0.1% |
| 23.082542 | 1 | 0.1% |
| 34.47772096 | 1 | 0.1% |
| Other values (65) | 65 | 9.2% |
| (Missing) | 627 |
| Value | Count | Frequency (%) |
| 18.66201469 | 1 | 0.1% |
| 20.56932966 | 1 | 0.1% |
| 20.74755019 | 1 | 0.1% |
| 20.82093992 | 1 | 0.1% |
| 21.10726644 | 1 | 0.1% |
| 21.1552942 | 2 | |
| 21.33821064 | 3 | |
| 21.51385851 | 1 | 0.1% |
| 21.60493827 | 1 | 0.1% |
| 21.62064772 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 57.46349378 | 1 | |
| 46.36734694 | 1 | |
| 39.50617284 | 1 | |
| 37.78272346 | 1 | |
| 35.79604579 | 1 | |
| 35.49786654 | 2 | |
| 35.11123879 | 1 | |
| 34.90026578 | 1 | |
| 34.47772096 | 1 | |
| 33.56401384 | 1 |
family
Text
Missing 
| Distinct | 80 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 436 |
| Missing (%) | 61.7% |
| Memory size | 5.7 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | YassourM_2018_M0018C |
|---|---|
| 2nd row | YassourM_2018_M0024M |
| 3rd row | YassourM_2018_M0402C |
| 4th row | YassourM_2018_M0402M |
| 5th row | YassourM_2018_M0402M |
| Value | Count | Frequency (%) |
| yassourm_2018_m0059c | 5 | 1.8% |
| yassourm_2018_m0259c | 5 | 1.8% |
| yassourm_2018_m1098c | 5 | 1.8% |
| yassourm_2018_m0297c | 5 | 1.8% |
| yassourm_2018_m0487c | 5 | 1.8% |
| yassourm_2018_m0261c | 5 | 1.8% |
| yassourm_2018_m0450c | 5 | 1.8% |
| yassourm_2018_m0038c | 5 | 1.8% |
| yassourm_2018_m0084c | 5 | 1.8% |
| yassourm_2018_m0399c | 5 | 1.8% |
| Other values (70) | 221 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 658 | |
| M | 658 | |
| s | 542 | |
| _ | 542 | |
| 2 | 374 | 6.9% |
| 8 | 352 | 6.5% |
| 1 | 340 | 6.3% |
| a | 271 | 5.0% |
| Y | 271 | 5.0% |
| r | 271 | 5.0% |
| Other values (9) | 1141 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5420 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 658 | |
| M | 658 | |
| s | 542 | |
| _ | 542 | |
| 2 | 374 | 6.9% |
| 8 | 352 | 6.5% |
| 1 | 340 | 6.3% |
| a | 271 | 5.0% |
| Y | 271 | 5.0% |
| r | 271 | 5.0% |
| Other values (9) | 1141 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5420 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 658 | |
| M | 658 | |
| s | 542 | |
| _ | 542 | |
| 2 | 374 | 6.9% |
| 8 | 352 | 6.5% |
| 1 | 340 | 6.3% |
| a | 271 | 5.0% |
| Y | 271 | 5.0% |
| r | 271 | 5.0% |
| Other values (9) | 1141 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5420 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 658 | |
| M | 658 | |
| s | 542 | |
| _ | 542 | |
| 2 | 374 | 6.9% |
| 8 | 352 | 6.5% |
| 1 | 340 | 6.3% |
| a | 271 | 5.0% |
| Y | 271 | 5.0% |
| r | 271 | 5.0% |
| Other values (9) | 1141 |
treatment
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
days_from_first_collection
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 436 |
| Missing (%) | 61.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.118081 |
| Minimum | 0 |
|---|---|
| Maximum | 167 |
| Zeros | 69 |
| Zeros (%) | 9.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 60 |
| Q3 | 77 |
| 95-th percentile | 167 |
| Maximum | 167 |
| Range | 167 |
| Interquartile range (IQR) | 77 |
Descriptive statistics
| Standard deviation | 52.025002 |
|---|---|
| Coefficient of variation (CV) | 0.96132384 |
| Kurtosis | -0.044597768 |
| Mean | 54.118081 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 0.87688467 |
| Sum | 14666 |
| Variance | 2706.6008 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 69 | 9.8% |
| 77 | 43 | 6.1% |
| 60 | 33 | 4.7% |
| 14 | 32 | 4.5% |
| 30 | 32 | 4.5% |
| 90 | 31 | 4.4% |
| 167 | 31 | 4.4% |
| (Missing) | 436 |
| Value | Count | Frequency (%) |
| 0 | 69 | |
| 14 | 32 | |
| 30 | 32 | |
| 60 | 33 | |
| 77 | 43 | |
| 90 | 31 | |
| 167 | 31 |
| Value | Count | Frequency (%) |
| 167 | 31 | |
| 90 | 31 | |
| 77 | 43 | |
| 60 | 33 | |
| 30 | 32 | |
| 14 | 32 | |
| 0 | 69 |
family_role
Categorical
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 436 |
| Missing (%) | 61.7% |
| Memory size | 5.7 KiB |
| child | |
|---|---|
| mother |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.4280443 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | child |
|---|---|
| 2nd row | mother |
| 3rd row | child |
| 4th row | mother |
| 5th row | mother |
Common Values
| Value | Count | Frequency (%) |
| child | 155 | 21.9% |
| mother | 116 | 16.4% |
| (Missing) | 436 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| child | 155 | |
| mother | 116 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 271 | |
| c | 155 | |
| i | 155 | |
| l | 155 | |
| d | 155 | |
| m | 116 | |
| o | 116 | |
| t | 116 | |
| e | 116 | |
| r | 116 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1471 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 271 | |
| c | 155 | |
| i | 155 | |
| l | 155 | |
| d | 155 | |
| m | 116 | |
| o | 116 | |
| t | 116 | |
| e | 116 | |
| r | 116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1471 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 271 | |
| c | 155 | |
| i | 155 | |
| l | 155 | |
| d | 155 | |
| m | 116 | |
| o | 116 | |
| t | 116 | |
| e | 116 | |
| r | 116 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1471 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 271 | |
| c | 155 | |
| i | 155 | |
| l | 155 | |
| d | 155 | |
| m | 116 | |
| o | 116 | |
| t | 116 | |
| e | 116 | |
| r | 116 |
born_method
Categorical
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 552 |
| Missing (%) | 78.1% |
| Memory size | 5.7 KiB |
| vaginal | |
|---|---|
| c_section |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.283871 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | vaginal |
|---|---|
| 2nd row | vaginal |
| 3rd row | vaginal |
| 4th row | vaginal |
| 5th row | vaginal |
Common Values
| Value | Count | Frequency (%) |
| vaginal | 133 | 18.8% |
| c_section | 22 | 3.1% |
| (Missing) | 552 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| vaginal | 133 | |
| c_section | 22 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 266 | |
| i | 155 | |
| n | 155 | |
| v | 133 | |
| g | 133 | |
| l | 133 | |
| c | 44 | 3.9% |
| _ | 22 | 1.9% |
| s | 22 | 1.9% |
| e | 22 | 1.9% |
| Other values (2) | 44 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1129 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 266 | |
| i | 155 | |
| n | 155 | |
| v | 133 | |
| g | 133 | |
| l | 133 | |
| c | 44 | 3.9% |
| _ | 22 | 1.9% |
| s | 22 | 1.9% |
| e | 22 | 1.9% |
| Other values (2) | 44 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1129 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 266 | |
| i | 155 | |
| n | 155 | |
| v | 133 | |
| g | 133 | |
| l | 133 | |
| c | 44 | 3.9% |
| _ | 22 | 1.9% |
| s | 22 | 1.9% |
| e | 22 | 1.9% |
| Other values (2) | 44 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1129 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 266 | |
| i | 155 | |
| n | 155 | |
| v | 133 | |
| g | 133 | |
| l | 133 | |
| c | 44 | 3.9% |
| _ | 22 | 1.9% |
| s | 22 | 1.9% |
| e | 22 | 1.9% |
| Other values (2) | 44 | 3.9% |
feeding_practice
Categorical
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 556 |
| Missing (%) | 78.6% |
| Memory size | 5.7 KiB |
| exclusively_breastfeeding | |
|---|---|
| mixed_feeding |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 19.357616 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | exclusively_breastfeeding |
|---|---|
| 2nd row | mixed_feeding |
| 3rd row | mixed_feeding |
| 4th row | mixed_feeding |
| 5th row | mixed_feeding |
Common Values
| Value | Count | Frequency (%) |
| exclusively_breastfeeding | 80 | 11.3% |
| mixed_feeding | 71 | 10.0% |
| (Missing) | 556 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| exclusively_breastfeeding | 80 | |
| mixed_feeding | 71 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 613 | |
| i | 302 | |
| d | 222 | 7.6% |
| l | 160 | 5.5% |
| s | 160 | 5.5% |
| x | 151 | 5.2% |
| g | 151 | 5.2% |
| n | 151 | 5.2% |
| f | 151 | 5.2% |
| _ | 151 | 5.2% |
| Other values (9) | 711 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2923 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 613 | |
| i | 302 | |
| d | 222 | 7.6% |
| l | 160 | 5.5% |
| s | 160 | 5.5% |
| x | 151 | 5.2% |
| g | 151 | 5.2% |
| n | 151 | 5.2% |
| f | 151 | 5.2% |
| _ | 151 | 5.2% |
| Other values (9) | 711 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2923 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 613 | |
| i | 302 | |
| d | 222 | 7.6% |
| l | 160 | 5.5% |
| s | 160 | 5.5% |
| x | 151 | 5.2% |
| g | 151 | 5.2% |
| n | 151 | 5.2% |
| f | 151 | 5.2% |
| _ | 151 | 5.2% |
| Other values (9) | 711 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2923 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 613 | |
| i | 302 | |
| d | 222 | 7.6% |
| l | 160 | 5.5% |
| s | 160 | 5.5% |
| x | 151 | 5.2% |
| g | 151 | 5.2% |
| n | 151 | 5.2% |
| f | 151 | 5.2% |
| _ | 151 | 5.2% |
| Other values (9) | 711 |
location
Categorical
High correlation  Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 626 |
| Missing (%) | 88.5% |
| Memory size | 5.7 KiB |
| Toronto | |
|---|---|
| Houston | |
| AnnArbor | |
| Boston |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.037037 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Toronto |
|---|---|
| 2nd row | Toronto |
| 3rd row | Toronto |
| 4th row | Toronto |
| 5th row | Toronto |
Common Values
| Value | Count | Frequency (%) |
| Toronto | 27 | 3.8% |
| Houston | 27 | 3.8% |
| AnnArbor | 15 | 2.1% |
| Boston | 12 | 1.7% |
| (Missing) | 626 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| toronto | 27 | |
| houston | 27 | |
| annarbor | 15 | |
| boston | 12 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 174 | |
| n | 96 | |
| t | 66 | 11.6% |
| r | 57 | 10.0% |
| s | 39 | 6.8% |
| A | 30 | 5.3% |
| T | 27 | 4.7% |
| H | 27 | 4.7% |
| u | 27 | 4.7% |
| b | 15 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 174 | |
| n | 96 | |
| t | 66 | 11.6% |
| r | 57 | 10.0% |
| s | 39 | 6.8% |
| A | 30 | 5.3% |
| T | 27 | 4.7% |
| H | 27 | 4.7% |
| u | 27 | 4.7% |
| b | 15 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 174 | |
| n | 96 | |
| t | 66 | 11.6% |
| r | 57 | 10.0% |
| s | 39 | 6.8% |
| A | 30 | 5.3% |
| T | 27 | 4.7% |
| H | 27 | 4.7% |
| u | 27 | 4.7% |
| b | 15 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 174 | |
| n | 96 | |
| t | 66 | 11.6% |
| r | 57 | 10.0% |
| s | 39 | 6.8% |
| A | 30 | 5.3% |
| T | 27 | 4.7% |
| H | 27 | 4.7% |
| u | 27 | 4.7% |
| b | 15 | 2.6% |
diet
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
travel_destination
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
visit_number
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
premature
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
birth_weight
Real number (ℝ)
High correlation  Missing 
| Distinct | 35 |
|---|---|
| Distinct (%) | 22.6% |
| Missing | 552 |
| Missing (%) | 78.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3474.1935 |
| Minimum | 2650 |
|---|---|
| Maximum | 4330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 2650 |
|---|---|
| 5-th percentile | 2913 |
| Q1 | 3205 |
| median | 3505 |
| Q3 | 3660 |
| 95-th percentile | 4005 |
| Maximum | 4330 |
| Range | 1680 |
| Interquartile range (IQR) | 455 |
Descriptive statistics
| Standard deviation | 355.53667 |
|---|---|
| Coefficient of variation (CV) | 0.10233646 |
| Kurtosis | 0.10306626 |
| Mean | 3474.1935 |
| Median Absolute Deviation (MAD) | 235 |
| Skewness | 0.0039501813 |
| Sum | 538500 |
| Variance | 126406.33 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3090 | 10 | 1.4% |
| 4005 | 9 | 1.3% |
| 3270 | 5 | 0.7% |
| 3385 | 5 | 0.7% |
| 3685 | 5 | 0.7% |
| 3570 | 5 | 0.7% |
| 3175 | 5 | 0.7% |
| 3210 | 5 | 0.7% |
| 3205 | 5 | 0.7% |
| 3640 | 5 | 0.7% |
| Other values (25) | 96 | 13.6% |
| (Missing) | 552 |
| Value | Count | Frequency (%) |
| 2650 | 4 | 0.6% |
| 2780 | 4 | 0.6% |
| 2970 | 5 | |
| 3090 | 10 | |
| 3095 | 4 | 0.6% |
| 3120 | 1 | 0.1% |
| 3145 | 1 | 0.1% |
| 3150 | 4 | 0.6% |
| 3175 | 5 | |
| 3205 | 5 |
| Value | Count | Frequency (%) |
| 4330 | 1 | 0.1% |
| 4310 | 5 | |
| 4005 | 9 | |
| 3900 | 1 | 0.1% |
| 3870 | 4 | |
| 3775 | 4 | |
| 3740 | 5 | |
| 3685 | 5 | |
| 3670 | 5 | |
| 3650 | 5 |
gestational_age
Real number (ℝ)
High correlation  Missing 
| Distinct | 21 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 552 |
| Missing (%) | 78.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.981935 |
| Minimum | 36.6 |
|---|---|
| Maximum | 42.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 36.6 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 39 |
| median | 40.1 |
| Q3 | 40.7 |
| 95-th percentile | 42.1 |
| Maximum | 42.4 |
| Range | 5.8 |
| Interquartile range (IQR) | 1.7 |
Descriptive statistics
| Standard deviation | 1.2399451 |
|---|---|
| Coefficient of variation (CV) | 0.031012632 |
| Kurtosis | 0.084003382 |
| Mean | 39.981935 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | -0.37036001 |
| Sum | 6197.2 |
| Variance | 1.5374638 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.7 | 25 | 3.5% |
| 39 | 13 | 1.8% |
| 40.3 | 12 | 1.7% |
| 40.9 | 10 | 1.4% |
| 38 | 10 | 1.4% |
| 40.1 | 9 | 1.3% |
| 40.6 | 9 | 1.3% |
| 40 | 6 | 0.8% |
| 41.4 | 6 | 0.8% |
| 41.7 | 5 | 0.7% |
| Other values (11) | 50 | 7.1% |
| (Missing) | 552 |
| Value | Count | Frequency (%) |
| 36.6 | 4 | 0.6% |
| 38 | 10 | |
| 38.4 | 5 | 0.7% |
| 38.7 | 5 | 0.7% |
| 38.9 | 5 | 0.7% |
| 39 | 13 | |
| 39.1 | 5 | 0.7% |
| 39.3 | 5 | 0.7% |
| 39.4 | 4 | 0.6% |
| 39.6 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 42.4 | 5 | 0.7% |
| 42.1 | 5 | 0.7% |
| 41.7 | 5 | 0.7% |
| 41.4 | 6 | 0.8% |
| 40.9 | 10 | 1.4% |
| 40.7 | 25 | |
| 40.6 | 9 | 1.3% |
| 40.3 | 12 | |
| 40.1 | 9 | 1.3% |
| 40 | 6 | 0.8% |
antibiotics_family
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
disease_subtype
Categorical
High correlation  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 352 |
| Missing (%) | 49.8% |
| Memory size | 5.7 KiB |
| CD | |
|---|---|
| UC | |
| undetermined_colitis | 20 |
Length
| Max length | 20 |
|---|---|
| Median length | 2 |
| Mean length | 3.0140845 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UC |
|---|---|
| 2nd row | UC |
| 3rd row | UC |
| 4th row | CD |
| 5th row | CD |
Common Values
| Value | Count | Frequency (%) |
| CD | 216 | |
| UC | 119 | 16.8% |
| undetermined_colitis | 20 | 2.8% |
| (Missing) | 352 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cd | 216 | |
| uc | 119 | |
| undetermined_colitis | 20 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 335 | |
| D | 216 | |
| U | 119 | 11.1% |
| e | 60 | 5.6% |
| i | 60 | 5.6% |
| n | 40 | 3.7% |
| d | 40 | 3.7% |
| t | 40 | 3.7% |
| u | 20 | 1.9% |
| r | 20 | 1.9% |
| Other values (6) | 120 | 11.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1070 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 335 | |
| D | 216 | |
| U | 119 | 11.1% |
| e | 60 | 5.6% |
| i | 60 | 5.6% |
| n | 40 | 3.7% |
| d | 40 | 3.7% |
| t | 40 | 3.7% |
| u | 20 | 1.9% |
| r | 20 | 1.9% |
| Other values (6) | 120 | 11.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1070 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 335 | |
| D | 216 | |
| U | 119 | 11.1% |
| e | 60 | 5.6% |
| i | 60 | 5.6% |
| n | 40 | 3.7% |
| d | 40 | 3.7% |
| t | 40 | 3.7% |
| u | 20 | 1.9% |
| r | 20 | 1.9% |
| Other values (6) | 120 | 11.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1070 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 335 | |
| D | 216 | |
| U | 119 | 11.1% |
| e | 60 | 5.6% |
| i | 60 | 5.6% |
| n | 40 | 3.7% |
| d | 40 | 3.7% |
| t | 40 | 3.7% |
| u | 20 | 1.9% |
| r | 20 | 1.9% |
| Other values (6) | 120 | 11.2% |
days_after_onset
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
creatine
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
albumine
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hscrp
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ESR
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ast
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
alt
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
globulin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
urea_nitrogen
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
BASDAI
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
BASFI
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
alcohol
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
flg_genotype
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
population
Categorical
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 352 |
| Missing (%) | 49.8% |
| Memory size | 5.7 KiB |
| Dutch |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Dutch |
|---|---|
| 2nd row | Dutch |
| 3rd row | Dutch |
| 4th row | Dutch |
| 5th row | Dutch |
Common Values
| Value | Count | Frequency (%) |
| Dutch | 355 | |
| (Missing) | 352 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dutch | 355 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 355 | |
| u | 355 | |
| t | 355 | |
| c | 355 | |
| h | 355 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1775 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 355 | |
| u | 355 | |
| t | 355 | |
| c | 355 | |
| h | 355 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1775 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 355 | |
| u | 355 | |
| t | 355 | |
| c | 355 | |
| h | 355 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1775 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 355 | |
| u | 355 | |
| t | 355 | |
| c | 355 | |
| h | 355 |
menopausal_status
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
lifestyle
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
body_subsite
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
uncurated_metadata
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
tnm
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
triglycerides
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hdl
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ldl
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hba1c
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
change_in_tumor_size
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
RECIST
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ORR
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
smoker
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ever_smoker
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
dental_sample_type
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
history_of_periodontitis
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
PPD_M
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
PPD_B
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
PPD_D
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
PPD_L
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
fobt
Boolean
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 626 |
| Missing (%) | 88.5% |
| Memory size | 1.5 KiB |
| False | |
|---|---|
| True | 14 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 67 | 9.5% |
| True | 14 | 2.0% |
| (Missing) | 626 |
disease_stage
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
disease_location
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
calprotectin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
HBI
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
SCCAI
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
mumps
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
cholesterol
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
c_peptide
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
glucose
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
creatinine
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
bilubirin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
prothrombin_time
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
wbc
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
rbc
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hemoglobinometry
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
FMT_role
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
subcohort
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
fmt_id
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
remission
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
dyastolic_p
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
systolic_p
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
insulin_cat
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
adiponectin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
glp_1
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
cd163
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
il_1
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
leptin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
fgf_19
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
glutamate_decarboxylase_2_antibody
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
HLA
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
autoantibody_positive
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
age_seroconversion
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
age_T1D_diagnosis
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hitchip_probe_class
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
previous_therapy
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
performance_status
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
toxicity_above_zero
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
PFS12
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
fasting_insulin
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
fasting_glucose
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
protein_intake
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
stec_count
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
shigatoxin_2_elisa
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
stool_texture
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
anti_PD_1
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ajcc
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
smoke
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
bristol_score
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hsCRP
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
LDL
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
mgs_richness
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
ferm_milk_prod_consumer
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
inr
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
birth_control_pil
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
c_section_type
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hla_drb12
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hla_dqa12
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hla_dqa11
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
hla_drb11
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
zigosity
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
brinkman_index
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
alcohol_numeric
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
breastfeeding_duration
Real number (ℝ)
High correlation  Missing 
| Distinct | 29 |
|---|---|
| Distinct (%) | 21.3% |
| Missing | 571 |
| Missing (%) | 80.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 357.47794 |
| Minimum | 108 |
|---|---|
| Maximum | 735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 108 |
|---|---|
| 5-th percentile | 110 |
| Q1 | 276.25 |
| median | 365 |
| Q3 | 411 |
| 95-th percentile | 699 |
| Maximum | 735 |
| Range | 627 |
| Interquartile range (IQR) | 134.75 |
Descriptive statistics
| Standard deviation | 138.14951 |
|---|---|
| Coefficient of variation (CV) | 0.38645605 |
| Kurtosis | 1.1680694 |
| Mean | 357.47794 |
| Median Absolute Deviation (MAD) | 69 |
| Skewness | 0.68877722 |
| Sum | 48617 |
| Variance | 19085.288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 365 | 10 | 1.4% |
| 326 | 5 | 0.7% |
| 488 | 5 | 0.7% |
| 699 | 5 | 0.7% |
| 331 | 5 | 0.7% |
| 408 | 5 | 0.7% |
| 385 | 5 | 0.7% |
| 376 | 5 | 0.7% |
| 305 | 5 | 0.7% |
| 246 | 5 | 0.7% |
| Other values (19) | 81 | 11.5% |
| (Missing) | 571 |
| Value | Count | Frequency (%) |
| 108 | 4 | |
| 110 | 4 | |
| 147 | 4 | |
| 182 | 4 | |
| 217 | 4 | |
| 246 | 5 | |
| 265 | 4 | |
| 268 | 5 | |
| 279 | 5 | |
| 296 | 5 |
| Value | Count | Frequency (%) |
| 735 | 4 | |
| 699 | 5 | |
| 488 | 5 | |
| 487 | 5 | |
| 486 | 4 | |
| 456 | 5 | |
| 419 | 4 | |
| 411 | 4 | |
| 409 | 4 | |
| 408 | 5 |
formula_first_day
Real number (ℝ)
High correlation  Missing 
| Distinct | 22 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 555 |
| Missing (%) | 78.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.14474 |
| Minimum | 1 |
|---|---|
| Maximum | 252 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 101 |
| Q3 | 184 |
| 95-th percentile | 227 |
| Maximum | 252 |
| Range | 251 |
| Interquartile range (IQR) | 181 |
Descriptive statistics
| Standard deviation | 90.324683 |
|---|---|
| Coefficient of variation (CV) | 0.89302406 |
| Kurtosis | -1.7662388 |
| Mean | 101.14474 |
| Median Absolute Deviation (MAD) | 94 |
| Skewness | 0.017678414 |
| Sum | 15374 |
| Variance | 8158.5484 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 26 | 3.7% |
| 3 | 22 | 3.1% |
| 184 | 14 | 2.0% |
| 183 | 9 | 1.3% |
| 199 | 5 | 0.7% |
| 172 | 5 | 0.7% |
| 16 | 5 | 0.7% |
| 194 | 5 | 0.7% |
| 175 | 5 | 0.7% |
| 186 | 5 | 0.7% |
| Other values (12) | 51 | 7.2% |
| (Missing) | 555 |
| Value | Count | Frequency (%) |
| 1 | 5 | 0.7% |
| 2 | 26 | |
| 3 | 22 | |
| 4 | 4 | 0.6% |
| 6 | 1 | 0.1% |
| 16 | 5 | 0.7% |
| 47 | 5 | 0.7% |
| 69 | 4 | 0.6% |
| 101 | 5 | 0.7% |
| 139 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 252 | 5 | 0.7% |
| 227 | 4 | 0.6% |
| 199 | 5 | 0.7% |
| 195 | 5 | 0.7% |
| 194 | 5 | 0.7% |
| 186 | 5 | 0.7% |
| 184 | 14 | |
| 183 | 9 | |
| 182 | 5 | 0.7% |
| 175 | 5 | 0.7% |
ALT
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
eGFR
Unsupported
Missing  Rejected  Unsupported 
| Missing | 707 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 5.7 KiB |
Interactions
Correlations
| BMI | DNA_extraction_kit | PMID | age | age_category | birth_weight | born_method | breastfeeding_duration | country | curator | days_from_first_collection | disease | disease_subtype | family_role | feeding_practice | fobt | formula_first_day | gender | gestational_age | infant_age | location | median_read_length | minimum_read_length | number_bases | number_reads | pregnant | study_condition | study_name | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BMI | 1.000 | 1.000 | 1.000 | -0.181 | 0.000 | NaN | 0.000 | NaN | 0.000 | 1.000 | NaN | 0.083 | 0.000 | 0.000 | 0.000 | 0.275 | NaN | 0.277 | NaN | 0.000 | 0.159 | 0.000 | NaN | -0.145 | -0.145 | 0.000 | 0.083 | 1.000 |
| DNA_extraction_kit | 1.000 | 1.000 | 1.000 | 1.000 | 0.585 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.192 | 1.000 | 1.000 | 1.000 | 0.709 | 0.927 | 0.598 | 0.636 | 1.000 | 0.891 | 1.000 |
| PMID | 1.000 | 1.000 | 1.000 | 1.000 | 0.585 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.192 | 1.000 | 1.000 | 1.000 | 0.709 | 0.927 | 0.598 | 0.636 | 1.000 | 0.891 | 1.000 |
| age | -0.181 | 1.000 | 1.000 | 1.000 | 0.868 | NaN | 0.000 | NaN | 0.398 | 1.000 | NaN | 0.203 | 0.000 | 0.000 | 0.000 | 0.069 | NaN | 0.000 | NaN | 0.000 | 0.155 | 0.000 | NaN | 0.006 | 0.005 | 0.000 | 0.203 | 1.000 |
| age_category | 0.000 | 0.585 | 0.585 | 0.868 | 1.000 | 1.000 | 1.000 | 1.000 | 0.619 | 0.585 | 0.784 | 0.553 | 1.000 | 0.992 | 1.000 | 0.000 | 1.000 | 0.158 | 1.000 | 1.000 | 0.286 | 0.372 | 0.525 | 0.323 | 0.321 | 0.482 | 0.553 | 0.585 |
| birth_weight | NaN | 1.000 | 1.000 | NaN | 1.000 | 1.000 | 0.353 | -0.176 | 1.000 | 1.000 | 0.009 | 1.000 | 0.000 | 1.000 | 0.556 | 0.000 | 0.128 | 0.599 | 0.390 | 0.000 | 0.000 | 0.282 | -0.056 | 0.015 | 0.012 | 1.000 | 1.000 | 1.000 |
| born_method | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.353 | 1.000 | 0.483 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.200 | 0.000 | 0.343 | 0.112 | 0.470 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 |
| breastfeeding_duration | NaN | 1.000 | 1.000 | NaN | 1.000 | -0.176 | 0.483 | 1.000 | 1.000 | 1.000 | -0.030 | 1.000 | 0.000 | 1.000 | 0.452 | 0.000 | 0.406 | 0.448 | -0.049 | 0.000 | 0.000 | 0.000 | -0.089 | 0.079 | 0.078 | 1.000 | 1.000 | 1.000 |
| country | 0.000 | 0.999 | 0.999 | 0.398 | 0.619 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 0.832 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.188 | 1.000 | 1.000 | 0.987 | 0.579 | 0.755 | 0.495 | 0.518 | 1.000 | 0.832 | 0.999 |
| curator | 1.000 | 1.000 | 1.000 | 1.000 | 0.585 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.192 | 1.000 | 1.000 | 1.000 | 0.709 | 0.927 | 0.598 | 0.636 | 1.000 | 0.891 | 1.000 |
| days_from_first_collection | NaN | 1.000 | 1.000 | NaN | 0.784 | 0.009 | 0.000 | -0.030 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.784 | 0.000 | 0.000 | -0.028 | 0.364 | -0.023 | 0.997 | 0.000 | 0.000 | -0.001 | 0.077 | 0.077 | 0.540 | 1.000 | 1.000 |
| disease | 0.083 | 0.891 | 0.891 | 0.203 | 0.553 | 1.000 | 1.000 | 1.000 | 0.832 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.375 | 1.000 | 0.207 | 1.000 | 1.000 | 0.536 | 0.466 | 0.664 | 0.462 | 0.463 | 1.000 | 1.000 | 0.891 |
| disease_subtype | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.214 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
| family_role | 0.000 | 1.000 | 1.000 | 0.000 | 0.992 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.784 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.494 | 1.000 | 1.000 | 0.000 | 0.000 | 0.109 | 0.240 | 0.237 | 0.482 | 1.000 | 1.000 |
| feeding_practice | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.556 | 0.200 | 0.452 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.980 | 0.000 | 0.402 | 0.000 | 0.000 | 0.000 | 0.000 | 0.327 | 0.303 | 1.000 | 1.000 | 1.000 |
| fobt | 0.275 | 1.000 | 1.000 | 0.069 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.375 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.207 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.375 | 1.000 |
| formula_first_day | NaN | 1.000 | 1.000 | NaN | 1.000 | 0.128 | 0.343 | 0.406 | 1.000 | 1.000 | -0.028 | 1.000 | 0.000 | 1.000 | 0.980 | 0.000 | 1.000 | 0.333 | 0.059 | 0.000 | 0.000 | 0.276 | 0.130 | 0.129 | 0.130 | 1.000 | 1.000 | 1.000 |
| gender | 0.277 | 0.192 | 0.192 | 0.000 | 0.158 | 0.599 | 0.112 | 0.448 | 0.188 | 0.192 | 0.364 | 0.207 | 0.214 | 0.494 | 0.000 | 0.207 | 0.333 | 1.000 | 0.453 | 0.000 | 0.000 | 0.159 | 0.195 | 0.217 | 0.219 | 0.231 | 0.207 | 0.192 |
| gestational_age | NaN | 1.000 | 1.000 | NaN | 1.000 | 0.390 | 0.470 | -0.049 | 1.000 | 1.000 | -0.023 | 1.000 | 0.000 | 1.000 | 0.402 | 0.000 | 0.059 | 0.453 | 1.000 | 0.000 | 0.000 | 0.195 | -0.057 | 0.034 | 0.032 | 1.000 | 1.000 | 1.000 |
| infant_age | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.997 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.061 | 0.119 | 0.300 | 0.293 | 1.000 | 1.000 | 1.000 |
| location | 0.159 | 1.000 | 1.000 | 0.155 | 0.286 | 0.000 | 0.000 | 0.000 | 0.987 | 1.000 | 0.000 | 0.536 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.167 | 0.123 | 0.000 | 0.536 | 1.000 |
| median_read_length | 0.000 | 0.709 | 0.709 | 0.000 | 0.372 | 0.282 | 0.000 | 0.000 | 0.579 | 0.709 | 0.000 | 0.466 | 1.000 | 0.000 | 0.000 | 0.000 | 0.276 | 0.159 | 0.195 | 0.061 | 0.000 | 1.000 | 0.615 | 0.324 | 0.353 | 0.000 | 0.466 | 0.709 |
| minimum_read_length | NaN | 0.927 | 0.927 | NaN | 0.525 | -0.056 | 0.000 | -0.089 | 0.755 | 0.927 | -0.001 | 0.664 | 1.000 | 0.109 | 0.000 | 1.000 | 0.130 | 0.195 | -0.057 | 0.119 | 1.000 | 0.615 | 1.000 | -0.623 | -0.628 | 0.000 | 0.664 | 0.927 |
| number_bases | -0.145 | 0.598 | 0.598 | 0.006 | 0.323 | 0.015 | 0.000 | 0.079 | 0.495 | 0.598 | 0.077 | 0.462 | 0.000 | 0.240 | 0.327 | 0.000 | 0.129 | 0.217 | 0.034 | 0.300 | 0.167 | 0.324 | -0.623 | 1.000 | 0.998 | 0.126 | 0.462 | 0.598 |
| number_reads | -0.145 | 0.636 | 0.636 | 0.005 | 0.321 | 0.012 | 0.000 | 0.078 | 0.518 | 0.636 | 0.077 | 0.463 | 0.000 | 0.237 | 0.303 | 0.000 | 0.130 | 0.219 | 0.032 | 0.293 | 0.123 | 0.353 | -0.628 | 0.998 | 1.000 | 0.133 | 0.463 | 0.636 |
| pregnant | 0.000 | 1.000 | 1.000 | 0.000 | 0.482 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.540 | 1.000 | 0.000 | 0.482 | 1.000 | 0.000 | 1.000 | 0.231 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.126 | 0.133 | 1.000 | 1.000 | 1.000 |
| study_condition | 0.083 | 0.891 | 0.891 | 0.203 | 0.553 | 1.000 | 1.000 | 1.000 | 0.832 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.375 | 1.000 | 0.207 | 1.000 | 1.000 | 0.536 | 0.466 | 0.664 | 0.462 | 0.463 | 1.000 | 1.000 | 0.891 |
| study_name | 1.000 | 1.000 | 1.000 | 1.000 | 0.585 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 0.891 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.192 | 1.000 | 1.000 | 1.000 | 0.709 | 0.927 | 0.598 | 0.636 | 1.000 | 0.891 | 1.000 |
Missing values
Sample
| study_name | sample_id | subject_id | body_site | antibiotics_current_use | study_condition | disease | age | infant_age | age_category | gender | country | non_westernized | sequencing_platform | DNA_extraction_kit | PMID | number_reads | number_bases | minimum_read_length | median_read_length | NCBI_accession | pregnant | lactating | curator | BMI | family | treatment | days_from_first_collection | family_role | born_method | feeding_practice | location | diet | travel_destination | visit_number | premature | birth_weight | gestational_age | antibiotics_family | disease_subtype | days_after_onset | creatine | albumine | hscrp | ESR | ast | alt | globulin | urea_nitrogen | BASDAI | BASFI | alcohol | flg_genotype | population | menopausal_status | lifestyle | body_subsite | uncurated_metadata | tnm | triglycerides | hdl | ldl | hba1c | change_in_tumor_size | RECIST | ORR | smoker | ever_smoker | dental_sample_type | history_of_periodontitis | PPD_M | PPD_B | PPD_D | PPD_L | fobt | disease_stage | disease_location | calprotectin | HBI | SCCAI | mumps | cholesterol | c_peptide | glucose | creatinine | bilubirin | prothrombin_time | wbc | rbc | hemoglobinometry | FMT_role | subcohort | fmt_id | remission | dyastolic_p | systolic_p | insulin_cat | adiponectin | glp_1 | cd163 | il_1 | leptin | fgf_19 | glutamate_decarboxylase_2_antibody | HLA | autoantibody_positive | age_seroconversion | age_T1D_diagnosis | hitchip_probe_class | previous_therapy | performance_status | toxicity_above_zero | PFS12 | fasting_insulin | fasting_glucose | protein_intake | stec_count | shigatoxin_2_elisa | stool_texture | anti_PD_1 | ajcc | smoke | bristol_score | hsCRP | LDL | mgs_richness | ferm_milk_prod_consumer | inr | birth_control_pil | c_section_type | hla_drb12 | hla_dqa12 | hla_dqa11 | hla_drb11 | zigosity | brinkman_index | alcohol_numeric | breastfeeding_duration | formula_first_day | ALT | eGFR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | HanniganGD_2017 | MG100208 | HanniganGD_2017_A29 | stool | no | adenoma | adenoma | 45.0 | NaN | adult | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 6133350 | 763336051 | 75 | 126 | SRR5665080 | NaN | NaN | Paolo_Manghi | 31.626276 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | HanniganGD_2017 | MG100207 | HanniganGD_2017_A28 | stool | no | adenoma | adenoma | 50.0 | NaN | adult | male | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 9320348 | 1161633690 | 75 | 126 | SRR5665075 | NaN | NaN | Paolo_Manghi | 31.673469 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | HanniganGD_2017 | MG100206 | HanniganGD_2017_A27 | stool | no | adenoma | adenoma | 68.0 | NaN | senior | male | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 6342570 | 787897159 | 75 | 126 | SRR5665074 | NaN | NaN | Paolo_Manghi | 25.216253 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | HanniganGD_2017 | MG100205 | HanniganGD_2017_A26 | stool | no | adenoma | adenoma | 80.0 | NaN | senior | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 12551662 | 1562374319 | 75 | 126 | SRR5665073 | NaN | NaN | Paolo_Manghi | 28.719723 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | HanniganGD_2017 | MG100204 | HanniganGD_2017_A25 | stool | no | adenoma | adenoma | 63.0 | NaN | adult | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 15176232 | 1883682219 | 75 | 126 | SRR5665072 | NaN | NaN | Paolo_Manghi | 27.335640 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | HanniganGD_2017 | MG100203 | HanniganGD_2017_A24 | stool | no | adenoma | adenoma | 67.0 | NaN | senior | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 8180768 | 1017721164 | 75 | 126 | SRR5665079 | NaN | NaN | Paolo_Manghi | 25.558846 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | HanniganGD_2017 | MG100202 | HanniganGD_2017_A23 | stool | no | adenoma | adenoma | 64.0 | NaN | adult | male | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 5502292 | 682604286 | 75 | 126 | SRR5665078 | NaN | NaN | Paolo_Manghi | 25.057360 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | HanniganGD_2017 | MG100201 | HanniganGD_2017_A22 | stool | no | adenoma | adenoma | 68.0 | NaN | senior | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 7447616 | 925042865 | 75 | 126 | SRR5665077 | NaN | NaN | Paolo_Manghi | 31.588613 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | HanniganGD_2017 | MG100200 | HanniganGD_2017_A21 | stool | no | adenoma | adenoma | 50.0 | NaN | adult | female | CAN | no | IlluminaHiSeq | MoBio | 30459201 | 2673274 | 331955334 | 75 | 125 | SRR5665076 | NaN | NaN | Paolo_Manghi | 23.828125 | NaN | NaN | NaN | NaN | NaN | NaN | Toronto | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | HanniganGD_2017 | MG100199 | HanniganGD_2017_A20 | stool | no | adenoma | adenoma | 47.0 | NaN | adult | female | USA | no | IlluminaHiSeq | MoBio | 30459201 | 4172300 | 520248981 | 75 | 126 | SRR5665134 | NaN | NaN | Paolo_Manghi | 24.221453 | NaN | NaN | NaN | NaN | NaN | NaN | AnnArbor | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| study_name | sample_id | subject_id | body_site | antibiotics_current_use | study_condition | disease | age | infant_age | age_category | gender | country | non_westernized | sequencing_platform | DNA_extraction_kit | PMID | number_reads | number_bases | minimum_read_length | median_read_length | NCBI_accession | pregnant | lactating | curator | BMI | family | treatment | days_from_first_collection | family_role | born_method | feeding_practice | location | diet | travel_destination | visit_number | premature | birth_weight | gestational_age | antibiotics_family | disease_subtype | days_after_onset | creatine | albumine | hscrp | ESR | ast | alt | globulin | urea_nitrogen | BASDAI | BASFI | alcohol | flg_genotype | population | menopausal_status | lifestyle | body_subsite | uncurated_metadata | tnm | triglycerides | hdl | ldl | hba1c | change_in_tumor_size | RECIST | ORR | smoker | ever_smoker | dental_sample_type | history_of_periodontitis | PPD_M | PPD_B | PPD_D | PPD_L | fobt | disease_stage | disease_location | calprotectin | HBI | SCCAI | mumps | cholesterol | c_peptide | glucose | creatinine | bilubirin | prothrombin_time | wbc | rbc | hemoglobinometry | FMT_role | subcohort | fmt_id | remission | dyastolic_p | systolic_p | insulin_cat | adiponectin | glp_1 | cd163 | il_1 | leptin | fgf_19 | glutamate_decarboxylase_2_antibody | HLA | autoantibody_positive | age_seroconversion | age_T1D_diagnosis | hitchip_probe_class | previous_therapy | performance_status | toxicity_above_zero | PFS12 | fasting_insulin | fasting_glucose | protein_intake | stec_count | shigatoxin_2_elisa | stool_texture | anti_PD_1 | ajcc | smoke | bristol_score | hsCRP | LDL | mgs_richness | ferm_milk_prod_consumer | inr | birth_control_pil | c_section_type | hla_drb12 | hla_dqa12 | hla_dqa11 | hla_drb11 | zigosity | brinkman_index | alcohol_numeric | breastfeeding_duration | formula_first_day | ALT | eGFR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 697 | YassourM_2018 | G102213 | M0038M | stool | NaN | control | healthy | NaN | NaN | adult | female | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 34497674 | 3405400567 | 57 | 101 | SRR7281035 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038M | NaN | 167.0 | mother | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 698 | YassourM_2018 | G104686 | M0053C | stool | NaN | control | healthy | NaN | 0.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 951324 | 94442466 | 50 | 101 | SRR7281036 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0053C | NaN | 0.0 | child | vaginal | NaN | NaN | NaN | NaN | NaN | NaN | 3120.0 | 41.4 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 699 | YassourM_2018 | G102217 | M0038C | stool | NaN | control | healthy | NaN | 60.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 33034210 | 3257723756 | 60 | 101 | SRR7281038 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038C | NaN | 60.0 | child | vaginal | mixed_feeding | NaN | NaN | NaN | NaN | NaN | 3620.0 | 39.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 246.0 | 2.0 | NaN | NaN |
| 700 | YassourM_2018 | G102218 | M0038C | stool | NaN | control | healthy | NaN | 90.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 42100416 | 4160467769 | 60 | 101 | SRR7281039 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038C | NaN | 90.0 | child | vaginal | mixed_feeding | NaN | NaN | NaN | NaN | NaN | 3620.0 | 39.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 246.0 | 2.0 | NaN | NaN |
| 701 | YassourM_2018 | G102211 | M0038M | stool | NaN | control | healthy | NaN | NaN | adult | female | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 39973032 | 3948822776 | 60 | 101 | SRR7281040 | yes | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038M | NaN | 0.0 | mother | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 702 | YassourM_2018 | G102212 | M0038M | stool | NaN | control | healthy | NaN | NaN | adult | female | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 22249756 | 2189057656 | 76 | 100 | SRR7281041 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038M | NaN | 77.0 | mother | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 703 | YassourM_2018 | G104681 | M0024M | stool | NaN | control | healthy | NaN | NaN | adult | female | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 61282548 | 6090258687 | 50 | 101 | SRR7281042 | yes | NaN | Marisa_Metzger | NaN | YassourM_2018_M0024M | NaN | 0.0 | mother | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 704 | YassourM_2018 | G102214 | M0038C | stool | NaN | control | healthy | NaN | 0.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 9321366 | 915709861 | 64 | 101 | SRR7281043 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038C | NaN | 0.0 | child | vaginal | mixed_feeding | NaN | NaN | NaN | NaN | NaN | 3620.0 | 39.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 246.0 | 2.0 | NaN | NaN |
| 705 | YassourM_2018 | G102215 | M0038C | stool | NaN | control | healthy | NaN | 14.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 25838128 | 2547202702 | 63 | 101 | SRR7281044 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038C | NaN | 14.0 | child | vaginal | mixed_feeding | NaN | NaN | NaN | NaN | NaN | 3620.0 | 39.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 246.0 | 2.0 | NaN | NaN |
| 706 | YassourM_2018 | G102216 | M0038C | stool | NaN | control | healthy | NaN | 30.0 | newborn | male | FIN | no | IlluminaHiSeq | PowerSoil | 30001517 | 33162504 | 3264202576 | 64 | 101 | SRR7281045 | no | NaN | Marisa_Metzger | NaN | YassourM_2018_M0038C | NaN | 30.0 | child | vaginal | mixed_feeding | NaN | NaN | NaN | NaN | NaN | 3620.0 | 39.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 246.0 | 2.0 | NaN | NaN |